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Abstract 

We consider nonlinear optimal control problems (OCPs) for which all problem 
data are polynomial. In the first part of the paper, we review how occupation 
measures can be used to approximate pointwise the optimal value function of a 
given OCP, using a hierarchy of linear matrix inequality (LMI) relaxations. In the 
second part, we extend the methodology to approximate the optimal value function 
on a given set and we use such a function to constructively and computationally 
derive an almost optimal control law. Numerical examples show the effectiveness of 
the approach. 



1 Introduction 



It is well known that solving an optimal control problem (OCP) can be a very hard task 
notwithstanding the power of theoretical tools such Pontryagin's minimum principle and 
Hamilton- Jacobi-Bellman optimality condition. This statement is particularly true when 
dealing with state and input constraints. 



Contribution. In this paper we consider the class of OCPs for which all problem data are 
polynomial. The approach we deploy (which was introduced in [4]) is based on moment 
theory and consists in deriving a hierarchy of convex linear matrix inequality (LMI) 
relaxations of the OCP which give an increasing sequence of lower bounds on the optimal 
value. These LMI problems can be solved using off-the-shelf semidefmite programming 
(SDP) solvers. 

*The authors are with LAAS-CNRS, University of Toulouse, France. D. Henrion is also with 
the Faculty of Electrical Engineering, Czech Technical University in Prague, Czech Republic. J. B. 
Lasserre is also with the Institute of Mathematics of the University of Toulouse, France. E-mail: 
{henrion , lasserre , savorgnan}@laas . f r 
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The contribution with respect to [4] and its extended version [5] is twofold. First, the 
derivation of the relaxation is obtained in a simpler way, starting from basic concepts. 
The second and more important contribution is that we show how the methodology can 
be applied to approximate the optimal value function on a set and to derive constructively 
and computationally a control law. The approach is illustrated on a few simple examples. 

Notation. R and N denote respectively the sets of real and integer numbers. M.[y] = 
[yi, . . . , y n ] denotes the ring of polynomials in the variable y. M[y]d = [yi, ■ ■ ■ ,y n ] denotes 
the ring of polynomials of degree at most d in the variable y. When y el" and a G N n , 
y a stands for y" 1 . . . y" n . Given a polynomial function ip, deg((^) is the maximal degree of 
its monomials. Given a differentiable function ip(y), V 2/ (</?) = [J^-, . . . , J^-] is its gradient 
with respect to y. 5 yo is the Dirac measure at yo- v' denotes the transpose of v. 



2 Problem definition 

Consider continuous-time systems described by the differential equation 

x(t) = f(t,x(t),u(t)) (1) 

where x G M ra and u G M. m are respectively the state vector and input vector. By defining 
the cost function 

/ h(t,x(t),u(t))dt + H(x(T)) (2) 
Jo 

the initial constraint 

x(0) G Ci = {x : g h (x) < 0, j = 1, . . . , m} 

and the final constraint 

x(T) EC F = {x: g Fj (x) < 0, j = 1, . . . , n F } 

we can formulate several OCPs. E.g., when Ci and Cp contain only one point we have the 
classical problem of driving the system from an assigned initial condition x(0) = x to a 
final condition x(T) = Xt by minimizing a given cost. 

In the sequel, we will consider all the problems which can be cast in this framework with 
the additional constraint on the trajectory (t,x(t),u(t)) G Ct, where 

C T = {(t,x,u) : g Tj (t,x,u) < 0, j = 1, . . . ,n T }. 

An important assumption which is necessary for the derivation of the methodology is that 
all problem data are polynomial. More precisely: 

Assumption 1 The functions f, h, H, g Ip g T . and g F . are polynomial. 
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3 The moment approach to optimal control 



The key idea underlying the moment approach is that of defining three occupation mea- 
sures which convey the information about the initial condition of the system, its trajectory 
and the final condition. The OCP is then rephrased in terms of the moments of such mea- 
sures. The convex problem obtained contains three ingredients: 

• a set of linear equality constraints on the moments which characterize the system 



• a set of semidefinite constraints which come from the fact that the moments belong 
to a measure; 

• a set of semidefinite constraints which translate the constraints induced by Cj, Ct 
and Cf on the supports of the measures. 

To derive this constraints we assume horizon T is fixed. 



3.1 The trajectory constraints 

To obtain the trajectory constraints we start from the idea that the system trajectories 
can be characterized studying how certain test functions evolve along the trajectories. 
For this purpose, we choose functions which are monomials of the form t a x@ . Consider a 
trajectory x(t). Using the fundamental theorem of calculus we can write 



The trajectory constraints are obtained by rephrasing equation (3) in terms of three 
properly defined occupation measures. 

The final occupation measure hf captures the information on the state at time T 



The initial occupation measure /// captures the information on the initial condition of the 
system 



The trajectory occupation measure ht captures the information on the value of t, x(t) and 
u(t) along the trajectory 



dynamics; 




(3) 
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Notice that [ip and /xj are probability measures, their mass is equal to 1. 
Next, if / G u] then d ^ x d * - 1 G M[t,x, u] because 

^f 1 = ^f 1 + V,(^)/(x, «) =^W 

for some coefficients a"£„ that depend on /. The degree of the derivative is deg(a; a t /3 ) — 
1 + deg(/). Using previous equations, (3) can be rephrased as 

T a j x^dfiF = a J x^dfij + J2 a t» J ^x^u u dfi T , (4) 



i.e., a linear relationship between the moments of /xp, Hi and \it- Namely, introducing the 
notation zp — J x^dfiF, wp = f x^dfxi, y lw = f t 1 x v u u dfip, we obtain 

T% = O a wp + o^ v y lw (5) 

for every a, (3 G N x W l . Notice that from (5), the mass of \xt is T. In compact notation, 
consider test functions of degree up to r and the canonical basis of monomials of degree 
at most r: 

(\r-. 2 TV — 1 V 1 / 

Xj [1, X\^ • • • , %m <^l> ^1^2? • • • 5 ^1 • • • 5 ^nJ ' 

Define the vectors z r — J m r (x)dfiF, w r = j m r (x)dfii and y k = j mk(t,x,u)dfj,T- Then, 

A F z r = Ajw r + A T y k (6) 

where k > r — 1 + deg(/) and the coefficients of the matrices Ap, Ai and A T can be 
obtained from equation (5). 

Define the coefficient vectors Ch and c# to be such that 

h(t,x,u) = c' h m,k(x,t,u), H{x) = c' H m r {x). 

Observe that 

/ h(t,x(t),u(t))dt + H(x(T)) = c' h y k + c' H z r , (7) 
i.e., the criterion of the OCP is a linear functional on z r and y k - 

So far, for a given trajectory x(t), we have characterized linear constraints satisfied by the 
moments of the three associated occupation meaures. Now, if the trajectory is unknown, 
the three measures are unknown, and we can consider the abstract linear programming 
(LP) problem J(/x/) = min^ ^ ^ f hdfix + f Hdjip subject to (4), which aims at finding 
the occupation measures associated with the optimal trajectory. The measures /xp, /xj, 
fix are characterized through their respective truncated moment vectors z r , w r , y&, the 
remaining difficulty being finding conditions that ensure that those vectors and indeed 
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moment vectors of measures with respective supports C F , Ci, Ct- This is explained in the 
next section. 



A nice feature of the approach is that we can play with the initial and final measures. For 
instance, if \ij = 5 X0 we retrieve the optimal cost J(S X0 ) of the OCP with fixed initial state 
x . Now, if fij is unknown, but with known support Ci, then J(//j) = mm xoeCl J(8 X0 )- 
Finally, if \ij is known, but not a Dirac, solving the above LP problem aims at computing 
J J(5 X0 )d/j,j(x ). 



3.2 The moment matrix constraints 

There exist linear programming (LP) or semidefinite programming (SDP) necessary and 
sufficient conditions for an infinite vector to be a moment vector, i.e., the vector of mo- 
ments of some finite Borel measure on a compact basic semi-algebraic set; see e.g. [8]. 
We chose the latter since it has shown to be more effective for numerical purposes [3]. 

With r an even number, let 

M(z r ) = J m r / 2 {x)m r / 2 {x)'d^ F 

be the moment matrix of order r associated with /i F . Obviously, M(z r ) is positive semidef- 
inite, denoted M(z r ) y 0. Therefore, in the convex relaxation of the OCP, one imposes 

M(z r ) h 0, (8) 

and similar constraints are imposed on w r and y^. 

3.3 The localizing matrix constraints 

Similarly to the previous subsection, one may express the support constraints induced by 
Ci, Ct and Cf in terms of linear matrix inequalities on z r ,w r and y^. To derive such 
inequalities, define 



d f ^g(g Fj (x)) if deg(g Fj (x)) is even 
deg(g F .(x)) + 1 if deg(g F (x)) is odd 



and the localizing matrix 

Lg F .(z r ) = J g Fj (x)m(, r _ dFi) /2{x)m {r ^ dF] )/ 2 {x)'d^ F . 

The matrix g Fj ■(x)m( r _ dF y2(x) m (r-d F .)/2( x y is positive semidefinite for every value of x 
such that gFj{x) > 0. Hence if /i F is supported on C F then L F .(z r ) y for every j. 
Therefore, in the convex relaxation of the OCP one imposes the semidefinite constraint 

L 9F {z r )yO j = l,...,n F (9) 



and similar semidefinite constraints for w r and y k . 

Further details on moment and localizing matrix constraints can be found in [2]. 
3.4 The convex relaxation 

To construct the convex relaxation of the OCP, let r and k be even numbers such that 

r > deg(H), k > deg(/i), k > r + deg(/). 

In this paper, we will assume that the initial probability measure \ii is known through its 
moments w r . 

The convex relaxation is the following truncated moment problem: 

min c' h y k + c' H z r 

A F z r = A/W r + A T y k 

M(z r )hO, L gFj (z r )hO, Vj = l,...,n F 

M(y k )hO, L gT .(y k )hO, Vj = l,...,n T 

where the notation w r indicates that the moment vector is known. Two important facts 
should be noticed for the moment problem (10): 

• the constraints on the moments correspond to necessary conditions and therefore, 
in general one only obtains a lower bound on the optimal value of OCP; 

• with r > r and k > k, the constraints of the original problem with r and k are a 
subset of the constraints of the problem with f and k. Therefore, increasing the 
value of r and k yields a monotonically nondecreasing sequence of lower bounds on 
the optimal value. 

Remark 1 If the initial measure jij was unknown, we would have to include the additional 
constraints M(w r ) y 0, L gi (w r ) y 0, Vj = l,...,n/ with now w r being an unknown 
moment vector with first entry equal to one. 

Remark 2 One goal of this paper is to derive the convex relaxation of the OCP starting 
from really basic notions. The same optimization problem can be also obtained using as 
a starting point the duality between the Banach space of bounded continuous functions on 
a compact set K and the Banach space of finite signed Borel measures on K 7 as done in 
[4, 5] where the sequence of lower bounds on the optimal value is shown to converge under 
some assumptions on the problem data. The interested reader is referred to these paper 
for further details. 
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4 The dual approach: SOS polynomials 



For the developments of the results in sections 5 and 6 and a better understanding of the 
moment approach to OCP, it is important to look at its dual formulation which has an 
interesting interpretation in terms of SOS polynomials. 

A polynomial p G M[x] of degree 2d is an SOS if p(x) = Y^i=i fi( x ) 2 f° r some (/»)f =1 £ R[x], 
and this implies that p is non-negative. And p is an SOS if and only if there exists a positive 
semidefinite matrix Q such that p(x) = m^x)' Qm^x) . Denote by S[x] the set of SOS 
polynomials and by T\x] r (with r even) the set of SOS polynomials of degree at most r. 
See [6] for more details. 

The SDP dual of (10) is 

max (AiWrYcu, 
Sjho,Qjho 



tit 



- A' T c^ + M*(S) + J2 L * 9Tj = c h (n) 

tit 

a' f c v + m*(q) + Y,l; f Xs j ) = c h 

where the symbol * indicates the adjoint operator, is the dual variable associated with 
the (moment) trajectory constraint, S and Q are the dual variables associated with the 
moment matrix constraints and Sj and Qj are the dual variables associated with the 
localizing matrix constraints. To interpret problem (11) in terms of SOS polynomials, 
define the polynomial function 

(t,x) i — > tp(t,x) = c^m^t^x). 

Explicitating the adjoint operators yields the following problem: 

max (p(0,x(0)) 

tp£R[t,x] r ,s£'E[t,x,u] k ,q£'E[x] r 
Sj&T,[t,x,u\ k _ dT ,q J &T,[x] r _ dF 



dip(x, t) 
dt 



tit 



+ V x ip(x,t)f(t,x,u) + h(t,x,u) = s(t,x,u) + g Tj (t, x, u)sj(t, x, u) 



J'=l 

<p(x,T) - H(x) = -q(x) -^2g Fj (x)qj(x). 

3=1 

(12) 

Consider the right handsides of the first and second constraints in (12). The first one is a 
polynomial non-negative on Ct, while the second one is a polynomial non-positive on Cp. 
In fact, both are Putinar's SOS representations of their respective left-hand-side [8]. As 
a consequence, every feasible solution ip of (12) is such that 

9<P ^ + V x (p(x, t)f(t, x, u) + h(t, x,u)>0 (13) 
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for all (t, x, u) G C T and 

H(x) - <p(T, x)>0 Vx G C F . (14) 

Suppose the OCP has a solution and consider an optimal control law u(t) which generates 
an optimal trajectory x(t). Therefore, the optimal value function is 

<p(t,x(t))= [ h(t,x(t),u(t))dt + H(x(T)). 

Since a solution (p(t,x) G M[t,x] of the optimization problem (12) is differentiable, the 
fundamental theorem of calculus yields 

V^(i(9),fl)/(fl,i(fl),S(fl))<M. 

Combining with the two preceding equations yields 

<p(t, x(t))-<p(t, x(t)) = [H(x(T))—ip(T, x(T))}f , v 

[f^+v^W, 0)/(0, u(e))+h(e, x(6),m)dB]. { ' 

From (13) and (14), both terms in square braces in the right handside of (15) are non- 
negative. Therefore: 



• <^(0, x(0)) — (p(0, x(0)) > and so, as in the moment formulation, one obtains a 
lower bound on the optimal value function. Therefore, the SOS formulation can be 
interpreted as the search of a smooth subsolution of the Hamilton- Jacobi-Bellman 
optimality condition 



mm 

u€U(t,x) 



d<f(x, t) 
dt 



+ V x (p(x, t) + h{t,x,u) 







(U(t,x) being the set of admissible input at x(t) 



x) 



if ip(0, x(0)) — (p(0, x(0)) is small, both terms in the right handside of (15) are small. 
This implies that the integrand of the second term is small all along the trajectory. 



Remark 3 So far we have discussed only OCPs for which the value of T is fixed. The 
moment approach also applies for problems with free terminal time and when the dynamics 
does not depend on t. By Bellman's principle of optimality, the optimal value function 
does not depend on t and so, in this case, the test functions are of the form x 13 and the 
occupation measure \i T is on 1R™ x lR m (instead o/Ixl"x ]R m ). 



We next illustrate the effectiveness of the approach on a simple numerical example which 
also motivates the developments of the next section. 



8 



Example 1 Consider the double integrator 



'xi(t)~ 
x 2 (t) 



1 




~ xl (t)~ 

x 2 (t) 



+ 




1 



u(t) 



with constraints x 2 (t) > —1 and — 1 < u(t) < 1, \ft. Driving in minimum time T(x) any 
initial condition x(0) = x to the origin is an interesting test problem because an analytic 
solution is available [5]. Indeed, with x 2 > — 1, 



T(x) = 



x 2 2 /2 + xi+x 2 + l ifx 1 >l-x%/2 
2^Jx\j1 + X\ + x 2 if xi < 1 — x\j2 and 
X \ > —xl/2siga(x 2 ) 
2\jx\j2 — xi — x 2 if xi < — x\j2 sign(x 2 ). 



Consider the initial condition x = (—0.5,-0.8) for which T(x) = 2.6111. To apply the 
moment approach, set h(x(t),u(t)) = 1 and H(x(T)) = 0. Solving the moment problem 
for different values of the degree r yields the values 



degree 


6 


10 


U 


18 


cost 


1.3882 


2.1533 


2.5335 


2.6061 



In figure 1 the function T(x) and ip(x) obtained by solving the dual problem for r = 18 
are represented with solid and dashed lines, respectively. Notice that tp{x) approximates 
very well the value of T{x) at x = (—0.5, —0.8) but it gives a loose lower bound on the 
other points. As expected, it also gives a very good lower bound on T(x(t)) at every point 
x(t) of an optimal trajectory from x(0) = x. 



In the next section, the moment approach to optimal control is extended to obtain good 
approximations of the value function on a larger set which will be used later to obtain a 
good control law from the knowledge of an optimal solution (p of (12). 



5 Approximation of the value function on a set 



Consider the dual problem (12). From the cost (p(0,x(0)) to be maximized, the optimal 
value function <p(t, x) is guaranteed to be a good approximation of T(x) only at all points 
(t,x(t)) of an optimal trajectory from x(0) = x. Ideally, for control synthesis purposes, 
we would like to enlarge the set where ip is a good approximation to a region that contains 
an optimal trajectory (t,x(t)) from x(0) = x. To do this, the key observation is that the 
trajectory moment constraint (6) is valid for any initial occupation measure \ii and not 
only for /!/ := 5 X . 
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Figure 1: T{x) (solid line) and (p(x) (dashed line) from example 1. The graph is obtained 
for X2 = —0.8 and r = 18. 



Indeed, let /i/ be a probability measure for which we know how to calculate its moment 
vector . The following equality holds 

(ArtZv)'^ = J ip(0, x)dfjij. 

For instance, if [ij is a uniform probability measure on S 

J cp(0, x)d/j,i — J cp(0,x)dx. 

In this case, the solution ip(t,x) of the optimization problem minimizes the C\ norm 
f s \<p(0,x) — (f(0,x)\dx where (p(t,x) is the optimal value function for the problem. This 
can be easily verified by using the fact that J" s |</?(0, x) — ip(0,x)\dx = f s (p(Q, x)dx — 
f s ip(0, x)dx. 



Example 2 Consider Example 1. Figure 2 displays T(x) (solid line) and (p(x) when S 
is the line segment [(—1,-0.8) — (—0.5,-0.8)] (dotted line) and when S is the segment 
[(-1, -0.8), (1, -0.8)] (dashed line). 



From figure 2 one can make the following observations: 



• There is a trade-off between the accuracy of the approximation of the value function 
and the size of the set considered for the approximation; 
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Figure 2: Example 2: T(x) (solid line), f(x) with S being the line segment [(—1, —0.8) — 
(-0.5,-0.8)] (dotted line) and the line segment [(-1,-0.8) - (1,-0.8)] (dashed line). 
The graph is obtained for x 2 = —0.8 and r = 18. 



• As the approximating function ip is a polynomial, it is difficult to obtain a good 
approximation of the optimal value function T(x) at a point x where it is not 
different iable. 



While the first fact is quite normal, the second deserves more attention. The quality 
of the approximation and the improvement when increasing the degree of the test func- 
tions r depends on the specific optimal control considered: if the value function to be 
approximated is smooth the moment approach performs better. 



6 Control synthesis 



As already observed, when an optimal solution of the SOS problem is close to the optimum 
value, the (non-negative) integrand on the right hand side of equation (15) takes small 
values along an optimal trajectory (0 when the HJB condition is satisfied). Therefore, 
given an optimal solution ip of the SOS relaxation (12), a natural control law candidate 
u(x(t)) is a global minimizer of 



mm 

u€U(t,x) 



dip(x, t) 
dt 



f V x (/?(x, t)f(t, x, u) + h(t, x, u) 



(16) 
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Suppose does not depend on t (i.e. gx^Xju)) and define a box S x around x and 
contained in the set {x : 3u : gx (x,u) < 0, Vj}. The control algorithm we propose is 
the following: 



1. Set x = x(t). 

2. Calculate the moments corresponding to a uniform probability measure on 

3. Solve the moment relaxation to the OCP. 

4. Apply the control obtained by minimizing (16) until x(t) Sx- 

5. Go to step 1. 



First we show how the control strategy can be applied to the double integrator with state 
and input constraints. 

Example 3 The set U(x) of feasible controls is the interval [—1,1] when x 2 > — 1 and 
[0, 1] when x 2 = —1. Indeed, when the trajectory constraint is active (grix) = x 2 + 1 = 0), 
an admissible trajectory must be such that gr(x) > and therefore u > 0. As f is affine 
in u and the cost does not depend on u, the control is easily obtained by checking the sign 
ofV x ip(x)[0 if. 

Figure 3 displays the trajectory obtained from the initial condition x(0) = (1, 1) with the 
choice S s = {x : \x — x\ < 0.05, x 2 > —1}- The simulation stopped after 700 iterations 
with a time of 3.5 seconds to reach a circle of radius 0.01; this is exactly the minimum 
time required to reach the origin when calculated using T(-). 

In some cases, like in the next example, the approximate value function can be computed 
once and for all at time t = 0. It can also be proved that the resulting control law is 
indeed stabilizing. 

Example 4 Consider the nonlinear system 



~ xl (t) 




~x 2 (t) 


- X1 (tf + X1 (t) 2 ' 


x 2 {t)_ 






u(t) 



(17) 



already considered in [7]. The objective of the optimal control problem with free terminal 
time consists of driving to the origin the initial states from the set (xi,x 2 ) G S = [—1, 1] x 
[—1,1] by minimizing the cost functional J Q T h(x(t),u(t))dt with 

u(t) 2 

h(x(t),u(t)) = Xl (t) 2 + x 2 (t) 2 + (18) 
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0.5 1 

x i 



1.5 



Figure 3: Example 3. Trajectory of the controlled system. 

From Remark 3, the approximated value function if (computed with \xj a known initial 
probability distribution on S) does not depend on t. As f is affine in u and h is quadratic 
in u, the control law u(x) can be obtained by the first order optimality conditions: 

u(x):=-^7 x <p(x)[0 !] T - 

As this OOP has no analytic solution, we evaluate the control performance by simulating 
the closed-loop system considering several initial conditions and then evaluating 

2(UB -LB) 

W = UB + LB < 19 > 

where LB is the lower bound on the cost given by the moment relaxation and UB is 
the upper bound given integrating the cost during the simulations. The initial conditions 
considered are on the boundary of S and are represented in Figure 4- For such values of 
x(0) the trajectories converge to the origin considering test functions of degree r > 6. In 
the next table, the maximal value of the gap for the initial conditions considered is reported 
for different values of r. 



degree 


6 


8 


10 


12 


gap 


0.2275 


0.0629 


0.0577 


0.0567 



Observe that for r = 8 the performance of the control is rather satisfactory. Although 
the gap is decreasing as expected (if optimality could be reached the gap would be 0), the 
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variation is really small. This is probably due to the fact that the current SDP solvers 
have some difficulties handling even medium size problems. Figure 4 shows the trajectories 
obtained for r = 10. 




-1 -0.5 0.5 1 

x i 

Figure 4: Example 4. Trajectories obtained for r = 10. 

Using the moment approach we can also show that the proposed control drives the state 
to the origin for every initial condition in S. Indeed, we consider the controlled system 
and the following problem: maximize J Q Xi(t) 2 + X2(t) 2 dt under the constraint Cj = S (all 
the occupation measures and T are undetermined) . By solving this problem for r > A, 
we found an upper bound on the cost. Since by linearization we can verify the origin is a 
local attractor, this upper bound implies that every trajectory starting from S reaches the 
origin. 



7 Conclusions 



This paper is a follow-up to [4, 5] where a sequence of lower bounds were derived for the 
optimal value of a polynomial optimal control problem (OCP), following an occupation 
measure approach. In the current paper, we propose some techniques to constructively 
derive a control law from the solution of the convex linear matrix inequality (LMI) relax- 
ations of the OCP. So our contribution can be seen as an extension to synthesis of the 
performance analysis results of [4, 5]. 
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Generally speaking, we believe that the moment formulation of OCP is an appealing alter- 
native to indirect methods based on Lyapunov or Hamilton- Jacobi-Bellman techniques. 
The moment formulation deals directly with systems trajectories. The resulting primal 
LMI moment problem admit a dual LMI sum-of-squares (SOS) formulation which is, how- 
ever, instrumental to the explicit computation of a control law. In this context, the nice 
interplay between functional analysis (measure theory) and algebraic geometry (repre- 
sentation of polynomial positive on semialgebraic) may provide constructive answers to 
potentially difficult control synthesis problems. 

Current limitations of the approach are as follows. 

First, as we are seeking a polynomial value function (a smooth subsolution of the Hamilton- 
Jacobi-Bellman equation) that approximates the (possibly non-smooth) optimal value 
function <p(t, x), it may happen that precision deterioriates at points where <p(t, x) is non- 
smooth. Partitioning of the state-space, and/or iterative computation of the polynomial 
value function in a neighborhood moving along optimal trajectories (like in example 3) 
could help address this issue, at the price of an increased computational burden. 

Second, we are relying on the performance of current available general-purpose SDP 
solvers. Semidefinite programming is a relatively young research field, and the degree of 
maturity of SDP solvers is far from that of, say, linear or convex quadratic programming 
solvers. More specifically, as far as we know, there is currently no numerically stable SDP 
solver, and no tractable estimate of the conditioning of an LMI problem. For example, 
it is expected that the choice of a basis to represent polynomials and moments has a 
significant impact on the problem conditioning, and hence on the numerical behavior of 
the solvers. 

Third, the number of variables and constraints in the LMI problems grows quickly as a 
function of the number of state and input variables and the degree of the polynomial ap- 
proximation of the value function. Current general-purpose SDP solvers can deal with a 
few thousands variables and constraints, well below the dimensions of moment LMI prob- 
lems corresponding to OCPs with, say, 6 states and 2 inputs. For these reasons, dedicated 
primal-dual interior-point methods tailored to the specific quasi-Hankel or quasi- Toeplitz 
structure of moment LMI problems would be welcome. 

Finally, we are currently working on a user-friendly OCP module for GloptiPoly 3 [1], 
that helps formulating explicitly an OCP as a generalized problem of moments. The user 
only provides the polynomial data of the OCP, and the module automatically generates 
an approximate optimal control law. Once it is ready and fully documented, the software 
will be freely available for download from the GloptiPoly 3 webpage 

www . laas . f r/~henrion/sof tware/gloptipoly3 
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